SEQUENCE LISTING r-<^ 

<J\C=> 



(1) GENERAL INFORMATION: 

(i) APPLICANT: TUCKER, KENNETH 
PLOSILA, LAURA 

(ii) TITLE OF INVENTION: MORAXELLA CATARRHAL I S OUTER MEMBRANE 
PROTEIN-106 POLYPEPTIDE, GENE SEQUENCE AND USES THEREOF 



U 



(iii) NUMBER OF SEQUENCES: 19 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PENNIE & EDMONDS 

(B) STREET: 1155 Avenue of the Americas- - - - - 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Baldwin, Geraldine F. 

(B) REGISTRATION NUMBER: 31,232 

(C) REFERENCE /DOCKET NUMBER: 7969-060 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-8864 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

( B) TYPE : amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

lie Gly lie Ser Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg 
15 10 15 

Glv Asp Lys Ser lie Ala lie Gly Asp lie Ala Gin Ala Leu Gly Ser 
20 25 30 
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Gin Ser lie Ala He Gly Asp Asn Lys He Val 
35 40 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



10 „ . _ _ _ _ _ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gly Thr Val Leu Gly Gly Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 3: 

- c (i) SEQUENCE CHARACTERISTICS: 

iD (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "probe" 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

24 

GGNACNGTNC TNGGNGGNAA RAAR 
(2) INFORMATION FOR SEQ ID NO: 4: 

2 c (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

on (ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1.-72 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAA GCG GAC GGG GGG AAA GGC GGA GCC AAT GCG CGC GGT GAT AAA TCC 48 
Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 
35 1 5 10 " 

72 

ATT GCT ATT GGT GAC ATT GCG CAA 
lie Ala He Gly Asp He Ala Gin 
20 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

5 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 
1 5 10 

He Ala He Gly Asp He Ala Gin 

10 . 20 _ _ ... „. 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "probe" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
2 0 YTTYTTNCCN CCNAGNACNG TNCC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : unknown 
2 5 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " "probe" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

3 0 24 

GGNACNGTNT TRGGNGGNAA RAAR 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = ""probe"" 



24 
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24 



(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
5 YTTYTTNCCN CCYAANACNG TNCC 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
1Q (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CTATTGACTT AAATCACCAT ATGGTTATAA TTTAGCATAA TGGTAGGCTT TTTGTAAAAA 60 

TCACATCGCA ATATTGTTTT ACTGTTACTA CCATGCTTGA ATGACGATCC AAATCACCAG 120 

15 ATTCATTCAA GTGATGTGTT TGTATACGCA CCATTTACCC TAATTATTTC AATCAAATGC 180 

CTATGTCAGC ATGTATCATT TTTTAAGGTA AACCACCATG AATCACATCT ATAAAGTCAT 240 

CTTTAACAAA GCCACAGGCA CATTTATGGC CGTGGCGGAA TATGCCAAAT CCCACAGCAC 300 

GGGGGGGGGT AGCTGTGCTA CAGGGCAAGT TGGCAGTGTA CGCACTCTGA GCTTTGCCCG 360 

2 0 TATTGCCGCG CTCGCTGTCC TCGTGATCGG TGCGACGCTC AATGGCAGTG CTTATGCAGG 420 

TATCGGAATT AGTGAAGCAG ACGGGGGAAA AGGCGGAGCC AATGCGCGCG GTGATAAATC 480 

CATTGCTATT GGTGATATTG CTCAGGCACT TGGCTCTCAA TCTATTGCTA TCGGTGACAA 540 

CAAAATAGTT CATAATTCAA ATAATAATGC TAATATAGGT GCCAAAGCCT CAGGTAATGA 600 

GTCCATCGCC ATCGGTGGTG ATGTATTGGC TTCTGGTCAT GCCTCGATTG CCATCGGTAG 660 

25 TGATGACTTA TATTTGAAAA AGGAAACGGT ACAGCAAATC TCAGAGCTTC TACCTATTAT 720 

TCGCGGACAG AAAGCATTAA ACGATATATA CCAACTAGCT GACACTAATC TTCAAAAATA 780 
T AG ACG C AC A CACGCACAGG GACACGCCAG TACTGCAGTG GGAGCCATGT CATATGCAAA 840 
GGGTCATTTT TCCAACGCCT TTGGTACACG GGCAACAGCT GAAGGTACCT ATTCCTTGGC 900 
3 0 AGTGGGTCTT ACCGCCACAG CCAAAGCAGC ATCTTCAATC GCTGTTGGTT CTAATGCACA 960 

AGCTATCGGG TTTGCAGCGA CAGCCGTTGG TGGAAGTACT CAAGTTAATT TGAATCGAGG 1020 

TATTGCCCTA GGTTTTGGTT CTCAGGTCCT TCAGAAGGAT AATGATGTAA ATGCAGCAAA 1080 

TGTACGGGCC TATGCACCAG ATGATAACCA GCCAATAGAC AACCGGTATA AAGCCACCTT 1140 

CAAGAATGGT GCTACGGATG TATTTTCCAT TGGTAATAGT AATGGGAATG ACAGTATCAG 1200 

35 GCGTAAAATC ATCAATGTCG GTGCAGGTTC TGCGGATACC GATGCGGTCA ATGTGGCACA 1260 

GCTTAAAGAG GCGGTGAGGC TGGCTAATCG TCAAATTACT TTTAAGGGTG ATGATAGCAA 1320 
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10 



15 



20 



25 



30 



35 



TAATAGAGTA 
CGCATTAACC 
TGCTGAAACT 
AGTTACCGTA 
TGGCATTGAT 
AAAGATTACC 
TAAAAAAGTT 
TAAAAATGGT 
AGACGGTACT 
CGAAACAAAT 
CAGCACCCTA 
AGTCGGTGCT 
TGAAAATACC 
GGATGCAAAC 
CAATGTCCAA 
CCTAAGCACT 
CCTATCACAA 
GGGCGAGATT 
CATCGGTGTG 
AAGCGATCTT 
CAGTGGCAAC 
AGGTGCTACC 
TGATAACAAT 
TGCTAAGCAA 
AGTGGGTGAA 
ACTAAGCAAT 
TATCGTTGAT 
CTTTAACCTA 
TTTTATCAAT 
GGCGTATGAT 
AAACCAAATT 
AATTAACTTT 
CGACAATCTA 



GAAAAAGGTT 
GATCATAACA 
TTAACCAGCC 
GGCAAAACCC 
GAAAGCAAAC 
AAACTTACTG 
AACCAAACCG 
AATGATGCTA 
CAAGTCAACA 
GGTACAGTTA 
AACAACGATG 
GATGGCATTA 
ACTCGCATTA 
AAACCCCGCC 
TCTGCCATTA 
GCTAATACCG 
GTACCGCTGA 
TTAAAGGTTA 
GTGGCTGATA 
GATGCGGTTA 
AACACCGCTA 
CCTGCCACCA 
GGTATAGCAC 
GATGGTTCAC 
GTTGAGATTA 
ACCCTAACCG 
AGTACTGACA 
AAAAATAATG 
GGCAATGCCA 
GTCAATGTGG 
GGCGTAAAAA 
AGTGTTAACT 
AACACCCTAG 



TGGGCAAGAC 
TCGGTGTGGT 
TTAAAATGGT 
GCCTTACCAG 
CTTATCTTGA 
CTGGTGTAGT 
CTGAAAGTGC 
ATGACAGCAA 
CCCTAAAACT 
CCTTTGGGCT 
GCTTATCTGT 
CATTTACTGA 
CCAGAGACGG 
TAACCCCAAC 
ACCCTGCTAC 
AAAAATCAGG 
CCTTTGCAGG 
AAGGTGGTAA 
GTACCGATAA 
ATACTAAAAC 
AGCTACAAAA 
ACAGCAAAAC 
TTGACGGCAC 
TTGATAAAAG 
CCACCAACGG 
ATGCCACCAA 
AAACCCGTGC 
GTGACGCCAA 
CCACCGCTAA 
ATGGTACAAC 
CCACCACACT 
CTGGTGATGA 
CTGGTGAAAT 



TTTAACTATC 
ACAAAATGGC 
TACCACTGAA 
AGATAAAATT 
TAAAGACACT 
AGATGACGAT 
TCTACAAACC 
AATCATCACC 
CAAAGGTGAA 
TAACCAAAAT 
TAAAAACACC 
TATCAGCAAT 
TATTGGTTTT 
TGGCATTAAC 
CAATGGTGGG 
CTCTGCCGCC 
TGATACAGGT 
AACCACAGCT 
TAGCTTAACC 
CCTAACTGCC 
TGGTGATTTA 
CATCTATGGC 
AACTTACATC 
CAAACCTTAT 
CATTAATGCA 
CGCAACAACA 
CGCCAGCATT 
AGACTTTGTC 
AGTCACTTAT 
CATTCATCTA 
GACCAAAACA 
CAAAGCCCTT 
TCGCAACACC 



ACAGGTGGTG 
GATGGTCTGA 
AACCTAACCG 
GGTTTTACCA 
GGCATTCATG 
GCGGCAACTT 
TTTACCGTTA 
GTGGGTAAAA 
AACGGTGTTG 
AACGGTCTGA 
AATAGTAACA 
AGTAAGCCAG 
GCTAATAATA 
GCAGGTGGTA 
CAGCTAGACT 
ACCATTAAAG 
CCTAATGTCA 
GATGATTTAA 
GTTAAACTTG 
AGCGATAAAG 
ACCTTTAGCA 
GTTGATGGCT 
ACCAAAGACA 
CTTGATAAGG 
GGTGGTAAAG 
GGGCATGTAA 
GGTGATGTGC 
TCCACTTATG 
GATGGCAAAG 
ACAGGCGCTG 
GATGCTAAAG 
ATTAACGCCA 
AAAGGCACAG 



CACAGACCAG 
AAGTTCAACT 
CCAACGAGAA 
ATGATATGAA 
CAGGTGGTCA 
ATGGACAGCT 
AAAAGGTAGA 
ATAACAAACC 
ATGTTACAAC 
CCGTTGGCAA 
AACAAATCCA 
GTGCTGGCAT 
CTGGTTCATT 
AAGAGCTGAC 
TTATGAACCG 
ACT TAT AC AA 
CCAAAAAACT 
CCAAAAATAA 
CTAAAACTTT 
TTACCGTAGA 
AACAAAATAC 
TGAAGTTTAC 
AAGTTGGCTT 
ACAAGCTAAA 
CCATCACAGG 
CTCAATTGGG 
TAAACGCAGG 
ACACTGTTGA 
CCAGTAAAGT 
ATGGCAATAA 
GTGATAAAGC 
AAGACATCGC 
CAGACACCGC 



1380 

1440 
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1620 

1680 

1740 
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1860 
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1980 

2040 
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2160 

2220 
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35 



CCTACAAACC 
CACCATCACC 
TAAAAACGGT 
CCAAAGCGGT 
AAACACCGCT 
TAATAATGGT 
TGGCTTTGCT 
CATTAACGCA 
CAATGATGCT 
CAGCAGTACT 
AGGTAATAAC 
TGTCATCACC 
TGTGGGCATT 
TGGCAAAGGC 
CACTCTAGCT 
AATCAAAGAC 
TAACTTGCAA 
TGCCGATGGC 
AGTGGTCTAT 
CGTAAAAACC 
TCAAGCTACT 
ATCTGGCGAC 
GGATGCTGAT 
AAATGATGGC 
CCAAACCCCA 
AGTAAATGAT 
TGAAAAAGCC 
TGCCGTTGCC 
GGG CG AG ACT 
CATCGGTGTG 
TCTTAACAGC 
CTCAAGCGGT 
TGGCAAGGTC 



TTTCAAGTCA 
GTGGGTAAAG 
CTTGATATTC 
CTTAAAGCCG 
GGTAACGAAC 
GTTGTAGGTG 
GGGACTAATG 
GGTGGTAAAA 
GTGACAGGCG 
GCCAAAACAG 
TTTACGGTTA 
TTTGCAGGTG 
GACCAAACCA 
ATTGTCATTG 
AATGTTACCA 
GAAGACAAAA 
GGCAATGGTG 
AATGCCACCA 
GATGTCAATG 
ACC AC AT TG A 
GGCGATGCGC 
ATCCAAACTG 
GGCAATAAGG 
ACAGTTGATA 
GATGGCACAT 
GCCAATAAAA 
GCTTCTGATA 
CAAACACCGC 
TTGACCATCA 
GTAGCAGGTA 
GTTAATGCAG 
CAAGCCAAAG 
ATCAGCAATG 



AAAAAGTCAA 
ATGCAAAAAC 
AAACCAATAA 
GCAACAACAC 
AAATCCAAGT 
CTGGCATTGA 
GCTCACTTGA 
AGATTACCAA 
GCAAGATTTA 
CACAAAACTC 
GTAACCCTTA 
AAAACGGCAT 
AAGGCTTAAC 
ACAGCCAAAA 
ATGATAAAGG 
CCCGTGCCGC 
AAGCGGTTGA 
CCGCTAAGGT 
TGGATGATAC 
CCAGTACTGG 
TTGTCAAGGC 
CCAAAGGGGC 
TCATCTATGA 
AAACCAAAGA 
TGGCTCAAAT 
AGCAAGGCAT 
ACAAAACCAA 
TGACCTTTGC 
AAGGTGGGCA 
CTGATGGCTT 
GTGGTACCAA 
CAAACACCGC 
TGGGCAAAGG 



AGAAAATGGT 
CAATCAAGTC 
AGATGGTACG 
CACTCTAAAC 
CGGTGCTGAT 
TGGCACAACT 
TAAAAGCAAA 
CATTCAATCA 
TGATTTAAAA 
ATTACACGAA 
CTCCAGTTAT 
TACCACCAAG 
CACGCCTAAG 
TGGTCAAAAT 
TAGCGTACGC 
CAGCATTGTT 
CTTTGTCTCC 
GACCTATGAT 
AACCATTGAA 
CACAGGTGCT 
CAGTGATATC 
AAGCCAAGCG 
CAGTACCGAT 
AGTTGCCAAA 
GAATGTCAAA 
CAATGAAGAC 
AAACGCCGCA 
AGGGGATACA 
AACAGACACC 
CACTGTCAAA 
AATTGATGAC 
TGTGCTAAGT 
CACAAAAGAC 



GATGATGATA 
AACACCCTAA 
GTTACCTTTG 
AACAATGGCT 
GGCGTGAAGT 
CGCATTACCA 
CCCCACCTAA 
GGTGAGATTG 
ACCGAACTTG 
TTCTCAGTAG 
GACACCTCAA 
GTAAATAAAG 
CTGACCGTGG 
ACCATCACAG 
ACCACAGAAC 
GATGTGCTAA 
ACTTATGACA 
GACACAAGCA 
GTTAAAGATA 
AATAAATTTG 
GTTGCTCATC 
AACAGCTCAG 
AACAAGTACT 
GACAAACTGG 
TCAGTCATTA 
AACGCCTTTG 
GTAACTGTGG 
GGCACAACGG 
AATAAGCTAA 
CTTGCCAAAG 
AAAGGCGTGT 
GCCAATGGGC 
ACCGACGCTG 



ATGACGCTGA 
AACTCAAAGG 
GCATTAACAC 
TGTCTATTAA 
TTGCCAAGGT 
GAGATGAAAT 
GCAAAGACGG 
CCCAAAACAG 
AAAACAAAAT 
CAGATGAACA 
AGACCTCTGA 
GTGTGGTGCG 
GTAATAATAA 
GACTAAGCAA 
AGGGCAAGAT 
GCGCAGGCTT 
CTGTCAACTT 
AAACCAGTAA 
AAAAACTTGG 
CCCTAAGCAA 
TAAACACCTT 
CAGGCTATGT 
ATCAAGCCAA 
TCGCCCAAGC 
ACAAAGAACA 
TTAAAGGACT 
GTGATTTAAA 
CTAAAAAACT 
CCGATAATAA 
ACCTAACCAA 
CTTTTGTAGA 
TGGACCTGGG 
CCAATGTACA 
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ACAGTTAAAC GAAGTACGCA ACTTGTTGGG TCTTGGTAAT GCTGGTAATG ATAACGCTGA 
CGGCAATCAG GTAAACATTG CCGACATCAA AAAAGACCCA AATTCAGGTT CATCATCTAA 
CCGCACTGTC ATCAAAGCAG GCACGGTACT TGGCGGTAAA GGTAATAACG ATACCGAAAA 
ACTTGCCACT GGTGGTGTAC AAGTGGGCGT GGATAAAGAC GGCAACGCTA ACGGCGATTT 
AAGCAATGTT TGGGTCAAAA CCCAAAAAGA TGGCAGCAAA AAAGCCCTGC TCGCCACTTA 
TAACGCCGCA GGTCAGACCA ACTATTTGAC CAACAACCCC GCAGAAGCCA TTGACAGAAT 
AAATGAACAA GGTATCCGCT TCTTCCATGT CAACGATGGC AATCAAGAGC CTGTGGTACA 
AGGGCGTAAC GGCATTGACT CAAGTGCCTC AGGCAAGCAC TCAGTGGCGA TAGGTTTCCA 
GGCCAAGGCA GATGGTGAAG CCGCCGTTGC CATAGGCAGA CAAACCCAAG CAGGCAACCA 
ATCCATCGCC ATCGGTGATA ACGCACAAGC CACAGGCGAT CAATCCATCG CCATCGGTAC 
AGGCAATGTG GTAGCAGGTA AGCACTCTGG TGCCATCGGC GACCCAAGCA CTGTTAAGGC 
TGATAACAGT TACAGTGTGG GTAATAACAA CCAGTTTACC GATGCCACTC AAACCGATGT 
CTTTGGTGTG GGCAATAACA TCACCGTGAC CGAAAGTAAC TCGGTTGCCT TAGGTTCAAA 
CTCTGCCATC AGTGCAGGCA CACACGCAGG CACACAAGCC AAAAAATCTG ACGGCACAGC 
AGGTACAACC ACCACAGCAG GTGCAACCGG TACGGTTAAA GGCTTTGCTG GACAAACGGC 
GGTTGGTGCG GTCTCCGTGG GTGCCTCAGG TGCTGAACGC CGTATCCAAA ATGTGGCAGC 
AGGTGAGGTC AGTGCCACCA GCACCGATGC GGTCAATGGT AGCCAGTTGT ACAAAGCCAC 
CCAAGGCATT GCCAACGCAA CCAATGAGCT TGACCATCGT ATCCACCAAA ACGAAAATAA 
AGCCAATGCA GGGATTTCAT CAGCGATGGC GATGGCGTCC ATGCCACAAG CCTACATTCC 
TGGCAGATCC ATGGTTACCG GGGGTATTGC CACCCACAAC GGTCAAGGTG CGGTGGCAGT 
GGGACTGTCG AAGCTGTCGG ATAATGGTCA ATGGGTATTT AAAATCAATG GTTCAGCCGA 
TACCCAAGGC CATGTAGGGG CGGCAGTTGG TGCAGGTTTT CACTTTTAAG CCATAAATCG 
CAAGATTTTA CTTAAAAATC AATCTCACCA TAGTTGTATA AAACAGCATC AGCATCAGTC 
ATATTACTGA TGCTGATGTT TTTTATCACT TAAACCATTT TACCGCTCAA GTGATTATCT 
TTCACCATGA CCAAATCGCC ATTGATCATA GGTAAACTTA TTGAGTAAAT TTTATCAATG 
TAGTTGTTAG ATATGGTTAA AATTGTGCCA TTGACCAAAA AATTACCGAT TTATCCCGAA 
AATTTCTGAT TATGATCACT TTTCATAAAT TTCCCCAATT TGTCTTTATA AATATCCCAA 
GAAATGGTAT TATTTTATTG CCATCAGCAT ATGCGACAAC TCATCGTATC ATCTTTTTAT 
CATAAAAATG CAAATAGGCA TATGCATTTT TTGAATTGAA CTTACGCACT GAGAGATCCC 
CTCATAATTT CCCCAAAGCG TAACCATGTG TGAATAAATT TTGAGCTAGT AGGGTTGCAG 
CCACGAGTAA GTCTTCCCTT GTTATTGTGT AGCCAGAATG CCGCAAAACT TCCATGCCTA 
AGCGAACTGT TGAGAGTACG TTTCGATTTC TGACTGTGTT AGCCTGGAAG TGCTTGTCCC 
AACCTTGTTT CTGAGCATGA ACGCCCGCAA GCCAACATGT TAGTTGAAGC ATCAGGGCGA 
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TTAGCAGCAT GATATCAAAA CGCTCTGAGC TGCTCGTTCG GCTATGGCGT AGGCCTAGTC 
CGTAGGCAGG ACTTTTCAAG TCTCGGAAGG TTTCTTCAAT CTGCATTCGC TTCGAATAGA 
TATTAACAAG TTGTTTGGGT GTTCGAATTT CAACAGGTAA GTTAGTTGCT AGAATCCATG 
GCTCCTTTGC CGACGCTGAG TAGATTTTAG GTGACGGGTG GTGACAATGA GTCCGTGTCG 
AGCGCTGATT TTTTCGGCCT TTAGAGCGAG ATTTATACAA TAGAATTTGG CATGAGATTG 
GATTGCTTTT AGTCAGCCTC TTATAGCCTA AAGTCTTTGA GTGACTAGAT GACATATCAT 
GTAAGTTGCT GATAGGTTTC CAGTTTTCCG CTCCTAGGTC TGCATATTGT ACTTTTCCTC 
TTACTCGACT TAACCAGTAC CAACCCAGCT TCTCAACGGA TTTATACCAT GGCACTTTAA 
AGCCAGCATC ACTGACAATG AGCGGTGTGG TGTTACTCGG TAGAATGCTC GCAAGGTCGG 
CTAGAAATTG GTCATGAGCT TTCTTTGAAC ATTGCTCTGA AAGCGGGAAC GCTTTCTCAT 
AAAGAGTAAC AGAACGACCG TGTAGTGCGA CTGAAGCTCG CAATACCATA AGCCGTTTTT 
GCTCACGGAT ATCAGACCAG TCAACAAGTA CAATGGGCAT CGTATTGCCC GAACAGATAA 
AGCTAGCATG CCAACGGTAT ACAGCGAGTC GCTCTTTGTG GAGGTGACGA TTACCTAACA 
ATCGGTCGAT TCGTTTGATG TTATGTTTTG TTCTCGCTTT GGTTGGCAGG TTACGGCCAA 
GTTCGGTAAG AGTGAGAGTT TTACAGTCAA GTAAGGCGTG GCAAGCCAAC GTTAAGCTGT 
TGAGTCGTTT TAAGTGTAAT TCGGGGCAGA ATTGGTAAAG AGAGTCGTGT AAAAT AT CG A 
GTTCGCACAT TTTGTTGTCT GATTATTGAT TTTTGGCGAA ACCATTTGAT CATATGACAA 
GATGTGTATC TACCTTAACT TAATGATTTT GATAAAAATC ATTAGGGGAT TCATCAGACT 
TACGCATCTT TCATTATGGG AATTAGGTCA GTAATTATGA CAAAAAATTA TGCATTATTA 
TCCGTCTCAG ATAAAACGCA AATCGTTGAA TTTGCCCAAG GTTTGGTAGA ATCTGGCTTT 
GGTATTTTAT CCACAGGTGG TACTTTTAAA CTCTTAAAAG AACATGGGAT TGACGCCATT 
GAGGTTTCTG CCCATACAGG TTTTGCTGAA ATGATGGATG GTCGTGTTAA GACCCTACAT 
CCCAAAATTC ATGGTGGTAT TTTGGGCCGT CGTGGCATTG ATGATGCCAT TATGAATGAA 
CATGGCATTG ATCGCATTGA TATCGTTGTC GTGAATTTAT ATCCATTTGC CAACACGGTC 
GCCAAAGACG GTGTTGTTAT GTCTGATGCG ATTGAAAATA TTGATATTGG TGGGCCTGCT 
ATGGTACGCT CAGCCGCCAA AAATCATGCC CATGTTGGTA TTATCACCAG CCCAAATGAC 
TACTCACGCA TCCTAGATGA ACTAAAAAAC CAAGGTCATT TAAGCCACAA CACTCGTTTT 
GATTTGGCAG TCAAAGCATT TGAACACACT GCCGCCTATG ATGGTATGAT TGCCAGCTGG 
CTAGGTGCAC GCTTACCAGT GGATAAAGAG ACGGCACCCA GTGATGATGC CACTGCAACC 
ACTCAATTTT CACGCACTTT TAATCACCAA TTCACCAAAG CACAAGAGCT TAGATATGGC 
GAAAACCCAC ATCAGTCAGC AGCCTTTTAT GTAGATGATC ATGCAACAGA AGCGTCTGTT 
GCGACTGCAC AGCAATTACA AGGTAAAGCG TTGTCTTATA ATAATATTGC TGATACCGAT 
GCGGCACTTG AGTGTGTCAA ATCTTTTACC ACGCCTGCTT GTGTGATTGT CAAACATGCC 
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AATCCTTGTG GTGTTGCAAC ATCAGAAAAC GGTATTTTAG ATGCTTATCA CTTAGCATAT 9300 

GCAACCGATC CTGAATCTGC CTTTGGTGGC ATTATTGCCT TTAACCGAGA ATTAGACAGT 9360 

GATACAGCCC GTACCATCGT TGAGCGTCAA TTTGTTGAAG TCATCATCGC ACCAAGCATC 9420 

GCTGAAGGTG TTCTAGAGCG GCCGCGGGCC CATCGATTTT CCACCCGGGT GGGGTACCAG 9480 

5 GTAAGTGTAC CCAATTCGCC CTATAGTGAG TCGTATTACA ATTCACTGGC CGTCGTTTTA 9540 
CA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
- 0 (A) LENGTH: 2123 amino acids 

r- ---- - .j.. B y- TYPE: amino acid _ . . . _ 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

i* Met Asn His He Tyr Lys Val He Phe Asn Lys Ala Thr Gly Thr Phe 
A 1 5 10 15 

Met Ala Val Ala Glu Tyr Ala Lys Ser His Ser Thr Gly Gly Gly Ser 

20 25 30 

Cvs Ala Thr Gly Gin Val Gly Ser Val Arg Thr Leu Ser Phe Ala Arg 

35 40 45 

He Ala Ala Leu Ala Val Leu Val He Gly Ala Thr Leu Asn Gly Ser 

50 55 60 

Ala Tyr Ala Gly He Gly He Ser Glu Ala Asp Gly Gly Lys Gly Gly 
65 70 75 80 

20 Ala Asn Ala Arg Gly Asp Lys Ser He Ala He Gly Asp He Ala Gin 

85 90 95 

Ala Leu Gly Ser Gin Ser He Ala He Gly Asp Asn Lys He Val His 

100 105 HO 

Asn Ser Asn Asn Asn Ala Asn He Gly Ala Lys Ala Ser Gly Asn Glu 

115 120 125 

Ser He Ala He Gly Gly Asp Val Leu Ala Ser Gly His Ala Ser He 
130 135 140 

or Ala He Gly Ser Asp Asp Leu Tyr Leu Lys Lys Glu Thr Val Gin Gin 
" 145 150 155 160 

He Ser Glu Leu Leu Pro He He Arg Gly Gin Lys Ala Leu Asn Asp 

165 170 175 

He Tyr Gin Leu Ala Asp Thr Asn Leu Gin Lys Tyr Arg Arg Thr His 

180 185 190 

Ala Gin Gly His Ala Ser Thr Ala Val Gly Ala Met Ser Tyr Ala Lys 

195 200 205 

Glv His Phe Ser Asn Ala Phe Gly Thr Arg Ala Thr Ala Glu Gly Thr 
on 210 215 220 

Tyr Ser Leu Ala Val Gly Leu Thr Ala Thr Ala Lys Ala Ala Ser Ser 
225 230 235 240 

He Ala Val Gly Ser Asn Ala Gin Ala He Gly Phe Ala Ala Thr Ala 

245 250 255 

Val Glv Gly Ser Thr Gin Val Asn Leu Asn Arg Gly He Ala Leu Gly 

260 265 270 

Phe Gly Ser Gin Val Leu Gin Lys Asp Asn Asp Val Asn Ala Ala Asn 

275 280 285 

val Arg Ala Tyr Ala Pro Asp Asp Asn Gin Pro He Asp Asn Arg Tyr 
" 290 295 300 

Lys Ala Thr Phe Lys Asn Gly Ala Thr Asp Val Phe Ser He Gly Asn 
305 310 315 320 

Ser Asn Gly Asn Asp Ser He Arg Arg Lys He He Asn Val Gly Ala 
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1 * 



10 



15 



325 330 r 335 

Glv Ser Ala Asp Thr Asp Ala Val Asn Valr-Ala Gin Leu Lys Glu Ala 

340 345 350 

Val Arq Leu Ala Asn Arg Gin He Thr Phe Lys Gly Asp Asp Ser Asn 

355 360 365 

Asn Arg Val Glu Lys Gly Leu Gly Lys Thr Leu Thr He Thr Gly Gly 

370 375 380 

Ala Gin Thr Ser Ala Leu Thr Asp His Asn He Gly Val Val Gin Asn 
385 390 395 400 

Glv Asp Gly Leu Lys Val Gin Leu Ala Glu Thr Leu Thr Ser Leu Lys 

405 410 415 

Met Val Thr Thr Glu Asn Leu Thr Ala Asn Glu Lys Val Thr Val Gly 

420 425 430 

Lys Thr Arq Leu Thr Thr Asp Lys He Gly Phe Thr Asn Asp Met Asn 

435 440 445 

Gly He Asp Glu Ser Lys Pro Tyr Leu Asp Lys Asp Thr Gly He His 

450 455 460 

Ala Glv Gly Gin Lys Ile.Thr Lys Leu Thr Ala Gly Val Val Asp Asp 
465 470 475 480 

Asp Ala Ala Thr Tyr Gly Gin Leu Lys Lys Val Asn Gin Thr Ala Glu 

485 490 495 

Ser Ala Leu Gin Thr Phe Thr Val Lys Lys Val Asp Lys Asn Gly Asn 

500 505 510 

Asp Ala Asn Asp Ser Lys He He Thr Val Gly Lys Asn Asn Lys Pro 

515 520 525 

Asp Gly Thr Gin Val Asn Thr Leu Lys Leu Lys Gly Glu Asn Gly Val 

530 535 540 

Asp Val Thr Thr Glu Thr Asn Gly Thr Val Thr Phe Gly Leu Asn Gin 
545 550 555 560 

Asn Asn Gly Leu Thr Val Gly Asn Ser Thr Leu Asn Asn Asp Gly Leu 

565 570 575 

Ser Val Lys Asn Thr Asn Ser Asn Lys Gin He Gin Val Gly Ala Asp 

580 585 590 

Gly He Thr Phe Thr Asp He Ser Asn Ser Lys Pro Gly Ala Gly He 

595 600 605 

Glu Asn Thr Thr Arg He Thr Arg Asp Gly He Gly Phe Ala Asn Asn 

610 615 620 

Thr Gly Ser Leu Asp Ala Asn Lys Pro Arg Leu Thr Pro Thr Gly He 
625 630 635 640 

Asn Ala Gly Gly Lys Glu Leu Thr Asn Val Gin Ser Ala He Asn Pro 

645 650 655 

Ala Thr Asn Gly Gly Gin Leu Asp Phe Met Asn Arg Leu Ser Thr Ala 

660 665 670 

Asn Thr Glu Lys Ser Gly Ser Ala Ala Thr He Lys Asp Leu Tyr Asn 

675 680 685 

Leu Ser Gin Val Pro Leu Thr Phe Ala Gly Asp Thr Gly Pro Asn Val 

690 695 700 

Thr Lys Lys Leu Gly Glu He Leu Lys Val Lys Gly Gly Lys Thr Thr 
705 710 715 720 

Ala Asp Asp Leu Thr Lys Asn Asn He Gly Val Val Ala Asp Ser Thr 

725 730 735 

Asp Asn Ser Leu Thr Val Lys Leu Ala Lys Thr Leu Ser Asp Leu Asp 

740 745 750 

Ala Val Asn Thr Lys Thr Leu Thr Ala Ser Asp Lys Val Thr Val Asp 

755 760 765 

Ser Gly Asn Asn Thr Ala Lys Leu Gin Asn Gly Asp Leu Thr Phe Ser 

770 775 780 

Lys Gin Asn Thr Gly Ala Thr Pro Ala Thr Asn Ser Lys Thr He Tyr 
785 790 795 800 

Glv Val Asp Gly Leu Lys Phe Thr Asp Asn Asn Gly He Ala Leu Asp 
805 810 815 

oc Glv Thr Thr Tyr He Thr Lys Asp Lys Val Gly Phe Ala Lys Gin Asp 
" * 820 825 830 

Gly Ser Leu Asp Lys Ser Lys Pro Tyr Leu Asp Lys Asp Lys Leu Lys 

835 840 845 

Val Gly Glu Val Glu He Thr Thr Asn Gly He Asn Ala Gly Gly Lys 
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20 



25 



30 



850 855 860 

Ala He Thr Gly Leu Ser Asn Thr Leu Thr Asp Ala Thr Asn Ala Thr 
865 870 875 880 

Thr Gly His Val Thr Gin Leu Gly He Val Asp Ser Thr Asp Lys Thr 

885 890 895 

Ara Ala Ala Ser He Gly Asp Val Leu Asn Ala Gly Phe Asn Leu Lys 
900 905 910 

c Asn Asn Gly Asp Ala Lys Asp Phe Val Ser Thr Tyr Asp Thr Val Asp 
5 915 920 925 

Phe He Asn Gly Asn Ala Thr Thr Ala Lys Val Thr Tyr Asp Gly Lys 

930 935 940 

Ala Ser Lys Val Ala Tyr Asp Val Asn Val .Asp Gly Thr Thr He His 
945 950 955 960 

Leu Thr Gly Ala Asp Gly Asn Lys Asn Gin He Gly Val Lys Thr Thr 

965 970 975 

Thr Leu Thr Lys Thr Asp Ala Lys Gly Asp Lys Ala He Asn Phe Ser 
- n 980 985 990 

val Asn Ser Gly Asp- Asp Lys Ala -Leu- He -Asn- Ala Lys .Asp lie Ala 

995 1000 1005 

Asp Asn Leu Asn Thr Leu Ala Gly Glu He Arg Asn Thr Lys Gly Thr 

1010 1015 1020 

Ala Asp Thr Ala Leu Gin Thr Phe Gin Val Lys Lys Val Lys Glu Asn 
025 1030 1035 1040 

Gly Asp Asp Asp Asn Asp Ala Asp Thr He Thr Val Gly Lys Asp Ala 
1045 1050 1055 

i c Lvs Thr Asn Gin Val Asn Thr Leu Lys Leu Lys Gly Lys Asn Gly Leu 
15 1060 1065 1070 

Asp He Gin Thr Asn Lys Asp Gly Thr Val Thr Phe Gly He Asn Thr 

1075 1080 1085 

Gin Ser Gly Leu Lys Ala Gly Asn Asn Thr Thr Leu Asn Asn Asn Gly 

1090 1095 HOO 

Leu Ser He Lys Asn Thr Ala Gly Asn Glu Gin He Gin Val Gly Ala 
105 1110 1H5 H20 

Asp Gly Val Lys Phe Ala Lys Val Asn Asn Gly Val Val Gly Ala Gly 

1125 H30 H35 

He Asp Gly Thr Thr Arg He Thr Arg Asp Glu He Gly Phe Ala Gly 

1140 H45 H50 

Thr Asn Gly Ser Leu Asp Lys Ser Lys Pro His Leu Ser Lys Asp Gly 

1155 H60 H65 

He Asn Ala Gly Gly Lys Lys He Thr Asn He Gin Ser Gly Glu He 

1170 1175 H80 

Ala Gin Asn Ser Asn Asp Ala Val Thr Gly Gly Lys He Tyr Asp Leu 
185 H90 H95 1200 

or Lvs Thr Glu Leu Glu Asn Lys He Ser Ser Thr Ala Lys Thr Ala Gin 

1205 1210 1215 

Asn Ser Leu His Glu Phe Ser Val Ala Asp Glu Gin Gly Asn Asn Phe 

1220 1225 1230 

Thr Val Ser Asn Pro Tyr Ser Ser Tyr Asp Thr Ser Lys Thr Ser Asp 

1235 1240 1245 

Val He Thr Phe Ala Gly Glu Asn Gly He Thr Thr Lys Val Asn Lys 
1250 1255 1260 



30 265 



XZoU ± zj zj 

Glv Val Val Arg Val Gly He Asp Gin Thr Lys Gly Leu Thr Thr Pro 
265 1270 1275 1280 

Lvs Leu Thr Val Gly Asn Asn Asn Gly Lys Gly He Val He Asp Ser 

1285 1290 1295 

Gin Asn Gly Gin Asn Thr He Thr Gly Leu Ser Asn Thr Leu Ala Asn 

1300 1305 1310 

Val Thr Asn Asp Lys Gly Ser Val Arg Thr Thr Glu Gin Gly Lys He 

1315 1320 1325 

He Lys Asp Glu Asp Lys Thr Arg Ala Ala Ser He Val Asp Val Leu 
1330 1335 1340 

oc Ser Ala Gly Phe Asn Leu Gin Gly Asn Gly Glu Ala Val Asp Phe Val 
35 345 1350 1355 1360 

Ser Thr Tyr Asp Thr Val Asn Phe Ala Asp Gly Asn Ala Thr Thr Ala 

1365 1370 1375 

Lys Val Thr Tyr Asp Asp Thr Ser Lys Thr Ser Lys Val Val Tyr Asp 
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1380 1385 1390 

Val Asn Val Asp Asp Thr Thr He Glu Val Lys Asp Lys Lys Leu Gly 

1395 I 400 1405 

Val Lys Thr Thr Thr Leu Thr Ser Thr Gly Thr Gly Ala Asn Lys Phe 

+ A n 1415 1420 

Ala Leu Ser Asn Gin Ala Thr Gly Asp Ala Leu Val Lys Ala s.r Asp 

1430 14Jb 
lie val Ala His Leu Asn Thr Leu Ser Gly Asp He Gin Thr Ala Lys 

1445 1450 
Gly Ala Ser Gin Ala Asn Ser Ser Ala Gly Tyr Val Asp Ala Asp Gly 

1460 1465 -±4/u 

Asn Lys Val He Tyr Asp Ser Thr Asp Asn Lys Tyr Tyr Gin Ala Lys 

1475 I 480 1 

Asn Asp Gly Thr Val Asp Lys Thr Lys Glu Val Ala Lys Asp Lys Leu 

1 aqo 1495 1500 

Val Ala Gin Ala Gin Thr Pro Asp Gly Thr Leu Ala Gin Met Asn^Val 



rnr 1510 1515 

10 -lye Ser- Val -He Asn Lys Glu Gin Val Asn-Asp Ala Asn. Lys Lys Gin 

1525 1530 J-doo 

Gly He Asn Glu Asp Asn Ala Phe Val Lys Gly Leu Glu Lys Ala Ala 

1540 1545 
Ser Asp Asn Lys Thr Lys Asn Ala Ala Val Thr Val Gly Asp Leu Asn 



Ala Val"" Gin Thr Pro Leu^Thr Phe Ala Gly Asp Thr Gly Thr Thr 

1t -n n 1575 1580 

15 Ala Lys I*- Leu Gly Glu Thr Leu Thr He Lys Gly Gly Gin Thr Asp 
coc 1590 1595 J-ouu 

Thr Asn Lys Leu Thr Asp Asn Asn He Gly Val Val Ala Gly Thr Asp 

1605 1610 
Gly Phe Thr Val Lys Leu Ala Lys Asp Leu Thr Asn Leu Asn Ser Val 

1620 I 625 , . 

Asn Ala Gly Gly Thr Lys He Asp Asp Lys Gly Val Ser Phe Val Asp 

1635 1640 xofio 

Ser Ser Gly Gin Ala Lys Ala Asn Thr Pro Val Leu Ser Ala Asn Gly 
iftcn 1655 1660 

20 Leu Asp Leu Gly Gly Lys Val He Ser Asn Val Gly Lys Gly Thr Lys 
fifit; 1670 1675 ?° 

Asp Thr Asp Ala Ala Asn Val Gin Gin Leu Asn Glu Val Arg Asn Leu 

1685 1690 j-d^d 

Leu Gly Leu Gly Asn Ala Gly Asn Asp Asn Ala Asp Gly Asn Gin Val 

1700 1705 1710 

Asn He Ala Asp He Lys Lys Rap Pro Asn Ser Glyjer Ser Ser Asn 

25 Arg Thr"" He Lys Ala Gly^nr Val Leu Gly Gly Lys Gly Asn Asn 
1 Tin 1735 1740 

Asp Thr Glu Lys Leu Ala Thr Gly Gly Val Gin Val Gly Val Asp Lye 

*~i 1750 i * 33 

Asp Gly Asn AlaAsn Gly Asp Leu Serpen Val Trp Val I^Thr Gin 

Leu Leu Ala Thr Tyr Asn 
1785 ] 

Asn Pro Ala Glu Ala He 
1800 1805 



1560 I 565 



1780 x/od 
Gin Thr Asn Tyr Leu Thr Asn Asn Pro Ala Glu Ala lie Asp Arg He 



Lys Asp Gly Ser Lys Lys Ala Leu Leu Ala Thr Tyr Asn Ala Ala Gly 
1780 1785 1790 

Tyr Leu Thr Asn Asi 
17 oc 180G 
30 Asn Glu Gin Gly He Arg Phe Phe His Val Asn Asp Gly Asn Gin Glu 
IfilO 1815 1820 

Pro Val Val Gin Gly Arg Asn Gly He Asp Ser Ser Ala Ser Gly Lys 
ooc 1830 1835 

His Ser Val Ala He Gly Phe Gin Ala Lys Ala Asp Gly Glu Ala Ala 

1845 1850 1033 

Val Ala lie Gly Arg Gin Thr Gin Ala Gly Asn Gin Ser He Ala He 
1 R60 1865 10 /u 

35 Gly Asp Asn Ala Gin Ala Thr Gly Asp Gin Ser lie Ala He Gly Thr 
J 1875 1880 I 885 

Gly Asn Val Val Ala Gly Lys His Ser Gly Ala lie Gly Asp Pro Ser 

i qqo 1895 1900 

Thr Val Lys Ala Asp Asn Ser Tyr Ser Val Gly Asn Asn Asn Gin Phe 
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1Q10 1915 1920 

Tnr Asp Ala Thr GlnThr Asp Val Phe Gly Val Gly Asn Asn lie Thr 
1925 I 930 



val Thr Glu Ser Asn Ser Val Ala^Leu Gly Ser Asn Serbia He Ser 

Ala Gly Th/nts Ala Gly Thr^ln^a Lys Lys Ser^sp Gly Thr Ala 

Thr Thr Thr Ala Gly Ala Thr Gly Thr Val Lys 
- 1975 1980 

GlyTln Thr Ala Val Gly Ala Val Ser Vainly Ala Ser Gly Alalia 

Arg Arg He Gin AsnTal Ala Ala Gly^lu Val Ser Ala Thr^er Thr 

Asp Ala val Asn 2 G?y Ser Gin Leu Tyr Lys Ala Thr Gin Gly He Ala 

2020 2025 
Asn Ala Thr Asn Glu Leu Asp His Arg He His Gin Asn Glu Asn Lys 
2035 2040 . 2045 



10 - Ala Asn Ail Gly lie Ser Ser Ala Met AUMet^a Ser.Met Pro Gin 
or\cn 2055 *UDw 

Ala'Tyr He Pro Gly Arg Ser Met Val Thr Gly Gly He Ala Thr His 

2070 2075 



2050 2055 . 2060 

L 

Asn Gly Gin Gly Ala'val Ala Val Gly Leu" Ser Lys Leu Ser Asp Asn 

2085 2090 
Gly Gin Trp Val Phe Lys He Asn Gly Ser Ala Asp Thr Gin Gly Hxs 

2100 2105 



1 c val Gly Ala Ala Val Gly Ala Gly Phe His Phe 
AD 2115 2120 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Thr Val Leu Gly Gly Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

on (i) SEQUENCE CHARACTERISTICS: 

JV (A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( d ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

35 Gly He Gly He Ser Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala 
i 5 10 

Arg Gly Asp Lys Ser He Ala He Gly Asp He Ala Gin Ala Leu Gly 

20 " 
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Ser Gin Ser He Ala He Gly Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

1Q Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 

xu - -j- - 5 - ■ - - - 10 

He Ala He Gly Asp He Ala Gin 
20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 18 amino acids 

" (B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
20 Gly Ala Arg Gly Cys Asn Gly Ala Tyr Gly Gly Asn Gly Gly Asn Ala 
1 

Ala Arg 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
ov . (B) TYPE: amino acid 

" (C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



30 



Tyr Thr Gly Asn Gly Cys Asp Ala Thr Arg Thr Cys Asn Cys Cys Asp 



1 5 
Ala Thr 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 
J (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

18 

GAAGCGGACG GGGGGAAA 

(2) INFORMATION FOR SEQ ID NO: 17: 

c (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
10 "TTGCGCAATG TCACCAAT ~ ----- . . _ . . 18 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

- c ( C ) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GAAGCGGACG GGGGGAAAGG CGGAGCCAAT GCGCGCGGTG ATAAATCCAT TGCTATTGGT 60 
2 0 GACATTGCGC AA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
25 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 
F 10 15 



72 



30 



1 5 

lie Ala He Gly Asp He Ala Gin 
20 



35 
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